Exact and approximate discrete optimization algorithms for finding useful disjunctions of categorical predicates in data analysis
نویسندگان
چکیده
We discuss a discrete optimization problem that arises in data analysis from the binarization of categorical attributes. It can be described as the maximization of a function F (l1(x), l2(x)), where l1(x) and l2(x) are linear functions of binary variables x ∈ {0, 1}n, and F : R2 −→ R. Though this problem is NP-hard, in general, an optimal solution x∗ of it can be found, under some mild monotonicity conditions on F , in pseudo-polynomial time. We also present an approximation algorithm which finds an approximate binary solution x2, for any given 2 > 0, such that F (l1(x∗), l2(x∗)) − F (l1(x), l2(x)) < 2, at the cost of no more than O(n log n + 2C/ √ 2n) operations. Though in general C depends on the problem instance, for the problems arising from binarization of categorical variables it depends only on F , and for all functions considered we have C ≤ 1/√2. Acknowledgements: This research was partially supported by the National Science Foundation, Grant IIS-0118635, by the Office of Naval Research, Grant N00014-92-J-1375, and by the Rutgers Distributed Laboratory for Digital Libraries, a Strategic Opportunity Project of Rutgers, the State University of New Jersey.
منابع مشابه
PERFORMANCE COMPARISON OF CBO AND ECBO FOR LOCATION FINDING PROBLEMS
The p-median problem is one of the discrete optimization problem in location theory which aims to satisfy total demand with minimum cost. A high-level algorithmic approach can be specialized to solve optimization problem. In recent years, meta-heuristic methods have been applied to support the solution of Combinatorial Optimization Problems (COP). Collision Bodies Optimization algorithm (CBO) a...
متن کاملPareto-optimal Solutions for Multi-objective Optimal Control Problems using Hybrid IWO/PSO Algorithm
Heuristic optimization provides a robust and efficient approach for extracting approximate solutions of multi-objective problems because of their capability to evolve a set of non-dominated solutions distributed along the Pareto frontier. The convergence rate and suitable diversity of solutions are of great importance for multi-objective evolutionary algorithms. The focu...
متن کاملNon-Fourier heat conduction equation in a sphere; comparison of variational method and inverse Laplace transformation with exact solution
Small scale thermal devices, such as micro heater, have led researchers to consider more accurate models of heat in thermal systems. Moreover, biological applications of heat transfer such as simulation of temperature field in laser surgery is another pathway which urges us to re-examine thermal systems with modern ones. Non-Fourier heat transfer overcomes some shortcomings of Fourier heat tran...
متن کاملA novel technique for a class of singular boundary value problems
In this paper, Lagrange interpolation in Chebyshev-Gauss-Lobatto nodes is used to develop a procedure for finding discrete and continuous approximate solutions of a singular boundary value problem. At first, a continuous time optimization problem related to the original singular boundary value problem is proposed. Then, using the Chebyshev- Gauss-Lobatto nodes, we convert the continuous time op...
متن کاملTesting Soccer League Competition Algorithm in Comparison with Ten Popular Meta-heuristic Algorithms for Sizing Optimization of Truss Structures
Recently, many meta-heuristic algorithms are proposed for optimization of various problems. Some of them originally are presented for continuous optimization problems and some others are just applicable for discrete ones. In the literature, sizing optimization of truss structures is one of the discrete optimization problems which is solved by many meta-heuristic algorithms. In this paper, in or...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Discrete Applied Mathematics
دوره 144 شماره
صفحات -
تاریخ انتشار 2004